AITopics | linear speed-up

Collaborating Authors

linear speed-up

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

Neural Information Processing SystemsDec-26-2025, 01:42:26 GMT

We analyse the learning performance of Distributed Gradient Descent in the context of multi-agent decentralised non-parametric regression with the square loss function when i.i.d.

decentralised non-parametric regression, name change, optimal statistical rate, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.73)

Add feedback

Scaffold with Stochastic Gradients: New Analysis with Linear Speed-Up

Mangold, Paul, Durmus, Alain, Dieuleveut, Aymeric, Moulines, Eric

arXiv.org Machine LearningMar-10-2025

This paper proposes a novel analysis for the Scaffold algorithm, a popular method for dealing with data heterogeneity in federated learning. While its convergence in deterministic settings--where local control variates mitigate client drift--is well established, the impact of stochastic gradient updates on its performance is less understood. To address this problem, we first show that its global parameters and control variates define a Markov chain that converges to a stationary distribution in the Wasserstein distance. Leveraging this result, we prove that Scaffold achieves linear speed-up in the number of clients up to higher-order terms in the step size. Nevertheless, our analysis reveals that Scaffold retains a higher-order bias, similar to FedAvg, that does not decrease as the number of clients increases. This highlights opportunities for developing improved stochastic federated learning algorithms

caffold, new analysis, stochastic gradient, (13 more...)

arXiv.org Machine Learning

2503.07594

Country: Europe > France (0.04)

Genre: Research Report > New Finding (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

Reviews: Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

Neural Information Processing SystemsJan-27-2025, 14:16:23 GMT

Update: I have read the author response and appreciate that they addressed some of my comments. The focus is on obtaining statistical guarantees about the generalization. This is a highly relevant direction to the growing body of work on decentralized training. The paper is generally well written, contains very original ideas, and I was very excited to read it. The main reason I didn't give a higher rating was because of the limitations listed at the beginning of Sec 5. I commend the authors for acknowledging them.

decentralised non-parametric regression, linear speed-up, optimal statistical rate, (5 more...)

Neural Information Processing Systems

Genre: Research Report (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

Reviews: Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

Neural Information Processing SystemsJan-27-2025, 14:16:12 GMT

This paper provides a nice and clean characterization of a decentralized learning problem. The result is perhaps unsurprising in its form, but the analysis is far from trivial. There are some nontrivial assumptions for their results to hold which perhaps limit the scope of this result but do suggest interesting avenues for future research in this increasingly important area. Overall, this is a solid contribution and should be of interest to NeurIPS attendees who work in optimization and distributed systems.

decentralised non-parametric regression, linear speed-up, optimal statistical rate

Neural Information Processing Systems

Industry: Education > Focused Education > Special Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Add feedback

Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

Neural Information Processing SystemsOct-11-2024, 02:37:22 GMT

We analyse the learning performance of Distributed Gradient Descent in the context of multi-agent decentralised non-parametric regression with the square loss function when i.i.d. We show that if agents hold sufficiently many samples with respect to the network size, then Distributed Gradient Descent achieves optimal statistical rates with a number of iterations that scales, up to a threshold, with the inverse of the spectral gap of the gossip matrix divided by the number of samples owned by each agent raised to a problem-dependent power. The presence of the threshold comes from statistics. It encodes the existence of a "big data" regime where the number of required iterations does not depend on the network topology. In this regime, Distributed Gradient Descent achieves optimal statistical rates with the same order of iterations as gradient descent run with all the samples in the network.

decentralised non-parametric regression, linear speed-up, optimal statistical rate, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

Richards, Dominic, Rebeschini, Patrick

Neural Information Processing SystemsMar-18-2020, 20:47:32 GMT

decentralised non-parametric regression, linear speed-up, optimal statistical rate, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback